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1 

METHODS FOR DETERMINING COAT COLOUR GENOTYPES IN PIGS 

The present invention relates to methods for determining coat colour genotype 
in pigs. In particular, it relates to methods of distinguishing between the alleles 
5 /, f, f and / of the £/Tgene. 

Coat colour is important to the pig breeding industry for a number of reasons. 
Firstly in a number of markets there is a preference for white skinned meat. 
This is due to the fact that pork is often marketed with the skin still attached, 
10 and skins from coloured pigs, even if dehaired, can still exhibit coloured hair 
roots, which can lead to negative perception by the consumer, since the surface 
of the meat may appear to be spotted by mould. It is therefore necessary in 
these markets to remove the skin from such carcasses, entailing additional cost. 

15 For example, in the US, coloured carcasses aire associated with approximately 
1% of skin defects requiring dehairing and skinning to remove pigment. As a 
result of this, coloured pig carcasses are generally discounted. Secondly gross 
variation in the appearance of pigs claimed to be genetically consistent for other 
traits can lead to questions about the consistency and quality of the animals in 

20 the mind of pig-producing customers. Breeders would also like to be able to 
ensure consistency in breeding populations. Thus breeders may wish to ensure 
that progeny produced by breeding crosses were always white. Alternatively a 
breeder producing a coloured breed may wish to ensure that the correct coat 
colour characteristics were maintained even during the introgression of genes 

25 from white lines. 
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White coat colour in pigs is controlled by the dominant white locus designated / 
(for inhibition of coat colour). Structural alterations in the porcine KIT gene 
have recently been correlated with various alleles of / and are probably 
responsible for the differences in coat colour pattern found or are closely linked 
5 to the mutations that are found (Johansson Moller et al 1996 Mammalian 
Genome 7, 822-830). Four structural versions of the porcine KIT gene have 
been identified to date and designated J, f, I* imd i (see figure 1 and table 1). 
The version found in fully coloured animals including wild boar and which is 
therefore generally accepted as the wild type allele is L The other versions of 

1 0 the gene known all involve duplication of at least part of the porcine KIT gene. 
f is a partially dominant allele and causes in the heterozygous state (f/i) a 
phenotype "patch" characterised by patches of white and coloured coat. / and I* 
are both fully dominant and cause a white phenotype both in the heterozygous 
and homozygous state. The only difference between / and /* is that there is a 

15 4bp deletion in intron 18 in one of the two KIT gene copies associated with the 
/ allele. This sequence polymorphism is not expected to have any functional 
effect. 

The phenotypes of a number of gene combinations are listed below with. the / 
20 allele being dominant over F and /, and / being dominant over / (Johansson et 
al. 1992 Genomics 14, 965-969) 



Genotype Colour 

77/ White 

UP White 

25 Hi White 
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P/i 



Patched 



Hi 



Coloured 



White 



5 Previously Moller Johansson et al. (1996 Mamraalian Genome 7, 822-830) 
revealed that KIT occurs as a single copy gene in coloured (i/i) animals but is 
duplicated in the /, t and P allele. This duplication allowed the differentiation 
of/, f and / from i through examination of the gene copy number of KIT and 
to use linked polymorphisms in or in the near vicinity of the KIT gene to 
10 distinguish different alleles at the / locus. This approach is the subject of 
International patent WO97/05278. 

A problem remaining from this method however is that it does not distinguish 
between J* and P. The result of this is that if one wishes to use the screen to 

15 remove heterozygous carrier animals of genotype IIP from a white pig line one 
also excludes ///* animals unnecessarily. The removal of animals, potentially 
very valuable if at the top of a breeding pyramid, unnecessarily can lead to a 
reduction in the rate of improvement of other traits within that particular group 
of animals. Other consequences include a general loss of genetic diversity and a 

20 loss of alleles at the locus in question that may have as yet undetected value. It 
is essential that one only excludes alleles where absolutely necessary. 

Further sequence analysis of genomic DNA revealed a mutation at the first 
nucleotide in intron 17 of KTT2 leading to a defect in the splicing of RNA 
25 transcribed from this particular copy of the KIT gene. In experiments involving 
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several pig breeds, this mutation showed a complete concordance with the 



presence of the / and /* alleles but was not found in the i and / alleles 

The majority of eukaryotic genes are composed of both coding (exon) and 
5 intervening noncoding (intron) regions. The latter of these sequences are 
removed from large pre-mRNA by a highly accurate cleavage and ligation 
reaction known as splicing. The result is a mature mRNA transcript, devoid of 
intron sequences, which is transported to the cytoplasm for translation. The 
splicing of eukaryotic genes is most likely a tw o step procedure. First, the pre- 

10 mRNA is cleaved at the 5' (donor) splice site followed by cleavage at the 3' 
(acceptor) splice site. The second step involves rejoining of the spliced exons to 
result in exclusion of the intervening intron, Splicing is therefore critically 
dependant on the accuracy of the cleavage and ligation reactions. This accuracy 
appears to be dependant on the almost completely invariant GT and AG 

15 dinucleotides present at the 5* and 3* exon/intron boundaries respectively. 
These dinucleotides and the often highly conserved surrounding sequences are 
known as splice sites and serve to bind the protein factors required to perform 
the cleavage and ligation reactions. 

20 The consequences for mutation occurring within a splice site may be a 
reduction in the amount of mature mRNA produced and/or ultilization of 
alternative but incorrect splice sites in the vicinity. The result is production of 
mRNA which either contains additional intron sequence or which may lack a 
portion of coding sequence. Where mutation occurs within the 5' (donor) splice 

25 site and prevents binding of protein factors, the exon is no longer recognised as 
such and is excised along with its neighbouring introns. This is referred to as 
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'exon skipping' (as reviewed by Cooper and Krawczak, 1994 Human Gene 
Mutation. Bios Scientific Publishers, Oxford, UK, 1994). 

In the mutation identified here the G of the conserved GT pair at the 
5 exonl7/intronl7 boundary region is altered to an A. That there is an alteration 
in the messenger RNA that is translated into protein and that the presence of the 
change correlates with the white/patched phenotypes suggests that this is the 
functional mutation rather than merely a linked marker. The splice variant is 
expected to give rise to a defective protein as 41 amino acids in the mature 

10 protein are missing. We therefore assume that this splice mutation is the causal 
mutation for the difference between / and /. Our current knowledge as regards 
molecular differences between alleles at the / locus is summarised in Table 1. In 
conclusion, we have identified two functionally important mutations. One is the 
gene duplication present in F y /* and / which by itself appears to cause the 

15 patch phenotype. The exact reason for this phenotypic effect is not known but 
one can speculate on the basis of comparative data from the mouse that the 
duplicated copy of the KIT gene may lead to a defect in gene expression which 
in turn affects melanocyte migration. This is especially valuable in that there 
can be no breakdown of the linkage between the DNA polymorphism and the 

20 trait itself. This has allowed us to develop a range of assays for the 
determination of the presence of the DNA polymorphism and the genotype of 
the animal with regard to coat colour determination to a significantly greater 
extent than has been possible before. The second mutation is the splice mutation 
that occurs in one of the KIT copies associated with the dominant white (/ and 

25 /*) alleles. The expression of a truncated form of the KIT protein is expected to 
cause a more severe defect in KIT function md a more severe effect on coat 
colour. 
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Table 1. 



10 



Allele 


KIT gene 


Intron 17 


Associated Phenotype 


i 


KIT1 


Normal 


Coloured 


F 


KIT1 


Normal 


Patch 




KIT2 


Normal 




I 


mi 


Normal 


Dominant White 




KTT2 


Mutated 




i* 


KTT1 


Normal 


Dominant White 




KIT2 


Mutated 





In addition to these two functionally important mutations, a mutation in the KIT 
20 gene with no known phenotypic effect, the 4 bp deletion in intron 18 has been 
documented at the / locus as described in International patent application No. 
PCT/GB96/01794. 

Thus, in a first aspect the present invention provides a method for determining 
coat colour genotype in a pig which comprises: 

25 (a) obtaining a sample of pig nucleic acid; and 

(b) analysing the nucleic acid obtained in (a) to determine whether a mutation 
is/is not present at one or more exon/intron splice sites of the KIT gene. 

In particular, the method determines whether a mutation is/is not present at the 
30 exon 17/intron 17 boundary, eg the substitution of the G of the conserved GT 
pair for A, 
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Reverse Transcriptase based Polymerase Chain analysis (RT-PCR) of the exon 
16-19 region of KIT mRNA in animals of the Large White breed (I/I) and 
comparison to that transcribed in Hampshire animals (Ui) revealed an extra 
species of molecule in the former animals. RT-PCR analysis of both breeds 
yielded a product fragment with a length of 424bp indicating that both types of 
animal contained a ATT mRNA transcript containing a region corresponding to 
this region of the gene. However in addition the /// animals yielded a RT-PCR 
product of 301 bp indicating the presence of an mRNA species which did not 
contain the foil transcription of the exon 16-19 DNA sequence (see example 1). 
These two transcripts have been shown to be derived from the separate 
duplicate copies of the KIT gene associated with the / allele. Sequencing over 
the KIT exon 17/intron 17 boundary revealed a difference in the intron 
boundary sequences present in the two duplicate copies of the KIT gene 
associated with the / allele (see example 2). The sequences are as shown 



below: 






Allele 


Kit Gene 


Exonl7 Intron 17 


i 


KIT1 


....AAC GTG 


F 


KIT1 


AAC GTG 


f 


KIT2 


AAC GTG 


I 


KIT1 


AAC GTG 


I 


KIT2 


AAC ATG 



This alteration of the 5' intron splice site from a GT pair to an AT pair affects 
the splicing of the pre mRNA and results in the loss of the whole of exon 17 
from the mRNA transcribed from the 7-KIT2 sequence. This will result in a 
modified KIT protein with the associated alterations in function and phenotype. 
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Based upon the sequence polymorphism rapid tests can be developed to 
determine the alleles carried by a specific animal at the dominant white locus. 

5 In one form such a test would comprise amplification of the region through the 
polymerase chain reaction (PCR) utilising genomic DNA from the animal in 
question as a template. The nucleotide sequence CATG comprises a recognition 
sequence for the restriction enzyme Main. This sequence is only present in the 
I-KIT 2 sequence at the junction position and thus one can differentiate the 
10 DNA molecules amplified from the two alleles by digestion of the amplification 
products with this enzyme or any other restriction enzyme with a suitable 
recognition site. 

Genomic DNA for use in such a test can be prepared by a wide range of 
15 available methods, from any tissues or products derived from the animal in 
question. A 175bp fragment of the ATT gene containing the exon 17/intron 17 
boundary region can be amplified from porcine genomic DNA using a pair of 
primers such as: 

20 KIT21 (5*-GTA TTC ACA GAG ACT TGG CGG C-3'); and 

KIT35 (5 '-AAA CCT GCA AGG AAA ATC CTT CAC GG-3'). 

The use of such primers in a PCR-RFLP test yields a fragment of 175bp before 
digestion with NIaUl. In alleles with the G present at nucleotide position 1 of 
25 intron 17 (I-KITl type sequence) there is only one Main cleavage site present 
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41bp from the end of the fragment. This site is present in all versions of the 
KIT sequence. Thus digestion of the 175bp fragments obtained from i and f 
alleles yields two fragments of 134bp and 41bp. Where the G is mutated to A 
as in the I-KIT2 sequence present in / and /* a further Malll cleavage site is 
created and thus digestion yields products of 80bp and 54bp and 41bp (see 
figure 3, example 2). A number of other oligonucleotides suitable as PCR 
primers could easily be derived from the sequence of this region of the porcine 
genome. An example of such a PCR-RFLP test is given in example 3. The 
results that would be obtained from such a PCR-RFLP test described above are 
as shown below: 



Genotype 


Fragment sizes 


Fragment sizes 




134 + 41 bp 


80 + 54 + 41 bp 


III 


Yes 


Yes 


I/f 


Yes 


Yes 


Hi 


Yes 


Yes 


f/F 


Yes 


No 


f/i 


Yes 


No 


Hi 


Yes 


No 


1*11 


Yes 


Yes 


1*11* 


Yes 


Yes 


HI* 


Yes 


Yes 


I*/F 


Yes 


Yes 



Thus, simply by analysing restriction products certain genotypes can be 
distinguished. However, there are others which cannot be distinguished in this 
first configuration of the test. In a further refinement the test can be carried out 
in such a way that the amount of each fragment can be calculated. By carrying 
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out electrophoresis on an apparatus that allows the quantification of each of the 
bands one can determine the ratio of the two forms of template in the genomic 
DNA sample used. Examples of such an apparatus include the Perkin Elmer 
Applied Biosystems 373 and 377 DNA sequencing systems. The application of 
this type of equipment is illustrated in example 4. Any equipment capable of 
determining the relative amounts of the products from the two different 
sequences is equally applicable to such tests. The expected results from such a 
test are shown below. 



Genotype Normal KIT Splice mutant KIT Ratio Normal 

- Splice mutant 

sequence Copies r 





Copies 






III 


2 


2 


1 


l/F 


3 


1 


3 


III* 


2 


2 


1 


Hi 


2 


1 


2 


F/r 


4 


0 


0 


f/i* 


3 


1 


3 


fa 


3 


0 


0 


I*/i 


2 


1 


2 


1*11* 


2 


2 


1 


i/i 


2 


0 


0 



Thus, using such a test one could identify all animals carrying alleles of 
dominant white that might dispose themselves or their offspring to exhibiting 
non-white coat colour (i oif) as those giving a ratio other than one. Depending 
on the requirement and the derivation of the lines under selection one could take 
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an appropriate subset of animals. For example in a cross derived animals 
carrying only alleles / and i one could identify any white individuals (/// or Hi) 
carrying i as they would have a ratio in the test of 2 as opposed to 1 for the /// 
animals. The distinction of alleles / and I* can be carried out on the basis of the 
5 4bp deletion as described in the previously filed patent publication 
WO97/05278. 

There are a range of techniques by which differentiation of alleles containing 
the splice mutation and those containing the normal sequence could be 
1 0 differentiated by a person expert in the field. 

Analysis of the genetic composition of an animal could be based upon a number 
of different source materials. These include genomic DNA, RNA and the KIT 
protein itself. There may also be effects on the levels and nature of other 
15 proteins, metabolites and RNA species which could be measured to create a 
more indirect assay. 

DNA could be used as the basis for a number of approaches to testing. One 
approach is through the amplification of the region of DNA containing the 

20 polymorphism using the polymerase chain reaction. This could then be linked to 
a number of forms of analysis of the product. Examples of electrophoresis 
based technologies include Single Strand Conformation Polymorphism (SSCP), 
Restriction Fragment Length Polymorphism (RFLP) and DNA sequencing of 
PCR products or direct genome sequencing. Other PCR based techniques that 

25 might alternatively be used include the Perkin Elmer TaqMan systems, Single 
Nucleotide Polymorphic Extension (SNuPE) and Minisequencing. PCR 
products from the region might also have application in hybridization based 
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approaches to the differentiation of alleles at this locus. Hybridization methods 
might include the probing of Southern transfers of genomic DNA with allele 
specific oligonucleotide, RNA, DNA fragment or Protein Nucleic Acid (PNA) 
probes. Other strategies of application here include hybridization of genomic 
DNA or PCR products (specific or whole genome) to oligonucleotide arrays 
possibly in the form of 'DNA chips*. Such arrays could consist of any reagent 
capable of binding DNA or RNA derived from the locus in question in an allele 
specific manner* Further useful methods of analysis also include 
oligonucleotide ligation assay and the ligase chain reaction. For a review on 
methods for detecting point mutations see Landegren, 1996, Laboratory 
Protocols for Mutation detection, Oxford University press, Oxford. 

A number of effects on the RNA produced from the gene in question have 
already and may in the future be observed. All such differences between the 
mutated and normal forms of the gene are useful targets for the determination of 
genotype and a large range of methods are available to the person skilled in the 
art. The changes that are or might be observed and methods of analysis are as 
follows. Alteration of the size, rate of processing, stability and quantity of RNA 
transcripts could be measured through widely used techniques such as northern 
blotting and RT-PCR as well as a number of the techniques described above for 
DNA analysis such as hybridization to oligonucleotide or DNA fragment 
arrays. 

Another approach which can be used is to use a linked genetic polymorphism 
which is closely associated with the presence or absence of the alteration at the 
exon/intron boundary. Such a polymorphism may occur in the KIT gene itself or 
in a chromosomal region linked to KIT. By using a single linked marker in 
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complete association with the presence/absence of the duplication or a 
combination of markers showing a partial association a highly informative test can 
be developed For instance, the SSCP (Single Strand Conformation 
Polymorphism) method may be used to develop such polymorphism. The 
5 principle of the method is that double-stranded DNA, produced by PCR, is 
denatured into single-stranded DNA which is then separated by non-denaturating 
gel electrophoresis. Under non-denaturating conditions the single-stranded DNA 
forms a secondary structure due to intra-strand interaction but a proportion of the 
single-stranded DNA will renature and form double-stranded DNA. Two types of 

10 polymorphism may be revealed by this method. Firstly, a difference in nucleotide 
sequence between two alleles may influence the secondary structure of 
single-stranded DNA which is revealed as a difference in the mobility rate during 
electrophoresis. Secondly, a difference in nucleotide sequence often influences 
the mobility of the heteroduplex DNA (A heteroduplex is a double-stranded DNA 

1 5 molecule formed by two single-stranded molecules representing different alleles). 

Association between genetic markers and genes responsible for a particular trait 
can be disrupted by genetic recombination. Thus, the closer the physical distance 
between the marker and the gene in question, the less likely it is that 
20 recombination will separate them. 

It is also possible to establish linkage between specific alleles of alternative DNA 
markers and alleles of DNA markers known to be associated with a particular 
gene (e.g. the KIT gene discussed herein), which have previously been shown to 
25 be associated with a particular trait. Thus, in the present situation, taking the KIT 
gene, it would be possible, at least in the short term, to select for pigs with a 
particular coat colour, indirectly, by selecting for certain alleles of a KIT gene 
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associated marker through the selection of specific alleles of alternative 
chromosome 8 markers. Examples of such markers known to be linked to the KIT 
gene on porcine chromosome 8 include genetic polymorphism in the KIT gene 
itself or in the closely linked genes for the a-subunit of platelet derived growth 
5 factor (PDGFRA) and albumin. 

Particular genetic markers associated with the KIT gene are microsatellites. These 
are simple sequence repeats of 4, 3 or, more usually, 2 nucleotides, which occur 
essentially at random around the genome at approximately every 50,000 bases 
10 (about 60,000 microsatellites per haploid genome). Stuttering of DNA 
polymerase during replication and unequal crossing-over during recombination 
are thought to result in the loss or gain of repeat units. This means that 
microsatellites are usually polymorphic and can have several repeat length alleles. 

15 Examples of linked microsatellite sequences include S0086 (Ellegren et al> 
Genomics, 16:431-439 (1993)) , S0017 (Coppieters et al y Animal Genetics 24: 
163-170 (1993)), Sw527, Swr750 and SW916 (Rhorer et al, Genetics, 136:231- 
245 (1994)) . It would be possible to select indirectly for alleles of the ATT gene 
linked to coat colour using any of the above markers, or indeed any other linked 

20 markers on porcine chromosome 8. 

Alterations in the level of the KIT protein could be measured using either 
specific antibodies for example in an ELISA system or on western blots or 
through the use of a range of biochemical techniques to measure the activity of 
25 the protein. Such tests could also be applied to other proteins or metabolites, 
the level or nature of which is altered by the presence of specific alleles at the 
locus. Different protein structures due to the presence of specific alleles could 
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be identified through the use of structure specific antibodies. As with the DNA 
and RNA based methods all these protein methods could be applied in a 
quantitative fashion thus achieving the full discriminatory capabilities possible. 

5 Kits could be produced for the specific analysis of the polymorphism described 
here alone and also with reagents allowing the combined analysis of the other 
polymorphisms previously reported and the subject of patent publication 
WO97/05278. 

1 0 The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The 
examples refer to the figures in which: 



Figure 1: is a schematic representation of the structure of the known 



15 



porcine KIT alleles, where 4bp del'n refers to the 4bp deletion in intron 
18 of one copy of the duplicated KIT gene DNA as reported by Moller et 
al 1996 and Exon 17- A refers to the change of nucleotide 1 of intron 17 
from a G as in the wild type allele to an A as reported in this patent; 



20 



25 



Figure 2: shows an electropherogram (4% agarose Nusieve/Seakem 3:1; 
100V for 80 min) showing RT-PCR products of £JTexon 16-19 with the 
primers KIT1F and KIT7R. The samples 1-3 and 4-6 are Swedish Large 
White and Hampshire pigs, respectively. The size difference between 
the 424 and 301 bp fragments is due to lack of exon 17 in the latter 
fraction. The two upper bands of the Yorkshire pigs were interpreted as 
heteroduplexes (HD); 
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Figure 3: shows a 48 bp sequence comprising 21 bp of KIT exon 17 and 
27 bp of KIT intron 17 where the position of the intron/exon border is 
marked with a vertical line , the splice site mutation (ntl G "* A ) indicated 
with a vertical arrow and identical bases in alleles / and i are marked 
5 with a dot; 

Figure 4: shows the results of Malll PCR RFLP test used to detect the 
presence of a splice site mutation in intron 17 of the KIT gene. Figure 
4A shows the position of two Main recognition sites within the PCR 

10 product amplified using primer pair KIT21 and KIT35. All distances are 

given in base pairs. Figure 4B shows the size of fragments which result 
following Main digestion of either normal KIT or splice mutant ATT. 
Figure 4C illustrates use of the PCR RFLP test. Lane 1 shows the 
KIT21/KIT35 amplified fragment undigested. Digestion was performed 

15 on PCR products amplified from, in Lane 2: a clone which contains the 

splice site mutation; Lane 3: a clone which contains the normal splice 
site sequence; Lane 4: genomic DNA from a coloured pig; Lane 5: 
genomic DNA from a white pig. Fragment sizes are given in base pairs; 

20 Figure 5: shows a comparison of the ratio of normal to splice mutant 

A/Tin animals of genotypes I/I, TJi and Vf\ 

Figure 6: shows the ratio values for 56 Landrace and 33 Large White 
animals. A clearly bimodal distribution is observed with 7 Landrace and 
25 3 Large White individuals having a ratio value of approximately 3 or 

above, suggesting them to be heterozygous carriers for the f allele 
(genotype I/f). This means f has gene frequency estimates of 6.25% 
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(7/112 chromosomes tested) and 4.5% (3/66 chromosomes tested) within 
the Landrace and Large White breeds respectively; and 

FIGURE 7: shows a plot of Ct FAM versus Ct TET for animals of 
5 genotypes Ii and // analysed for KIT splice mutant genotype using 

TaqMan® chemistry. 

Example 1 

10 

RT-PCR of porcine KIT exon 16-19 

i. mRNA purification from blood samples 

Fresh blood samples were collected in citrate tubes from coloured Hampshire 
15 pigs and Large White pigs. Leukocytes were isolated from 5 ml blood using 
Ficoll 100 (Pharmacia Biotech). Isolation of mRNA from leukocytes was then 
carried out using the Quickprep Micro mRNA purification kit (Pharmacia 
Biotech). The mRNA was stored as a precipitate under ethanol at -70°C for up 
to one month before use in reverse transcriptase (RT)-PCR. 

20 

ii. RT-PCR of KIT exon 16-19 

First-strand cDNA synthesis was accomplished using the First-Strand cDNA 
Synthesis kit (Pharmacia Biotech) so that -100 ng mRNA was randomly 
primed by 0.1 |ig pd(N6) in a total volume of 15 pi. Two \i\ of the completed 
25 first cDNA strand reaction was then directly used per 12 \xl PCR reaction by 
adding 10 \i\ PCR mix containing 10 pmol each of the mouse/human derived 
primers KIT1F and KIT7R (5'-TCR TAC ATA GAA AGA GAY GTG ACT C 
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and 5'-AGC CTT CCT TGA TCA TCT TGT AG, respectively; Moller et al. 
1996, supra), 1.2 j^l 10 x PCR-buffer (10 mM Tris-HCl, pH 8.3, 50 mM KC1) 
and 0.5 U of AmpliTaq polymerase (Perkin-Elmer) incubated with an equal 
amount Taqstart antibody (Clontech) at 25 °C for 5 min to achieve a hot start 
5 PCR. The reaction was covered with 20 \x\ mineral oil and thermocycled in a 
Hybaid Touchdown machine (Hybaid) with 40 cycles at 94°C for 1 min, 55-48 
°C (touchdown one degree per cycle the first seven cycles and then 48°C in the 
remaining cycles) for 1 min and 72°C for 1 min. After PCR 2|il loading dye 
was added to each sample which were then loaded on 4% agarose gel 
1 0 (Nusieve/Seakem 3:1, FMC Byproducts) and electrophoresed with 1 00 V for 80 
min. Products were visualised by ethidium bromide staining and UV- 
illumination. 

iii.Cloning and sequencing of RT-PCR-products 

15 The RT-PCR products representing KIT exon 16-19 were purified by extraction 
from 2% agarose gels using the QIAEX gel extraction kit (QIAGEN) and 
cloned into the pUC18 vector using the Sureclone ligation kit (Pharmacia 
Biotech). Plasmids were isolated using the QIAFilter plasmid Midi kit 
(QIAGEN). Cloned plasmid inserts were sequenced using dye primer 

20 chemistry. Each cycling reaction was prepared with plasmid template DNA and 
ready reaction mix containing fluorescently labelled M13 forward or reverse 
primer as described in the ABI Prism protocol P/N 402113 (Perkin Elmer). 
Cycling and sample pooling were performed using a Catalyst 800 Molecular 
Biology Workstation (ABI) following the instruments user manual (Document 

25 number 903877, Perkin Elmer). The resulting extension products were purified, 
loaded and analysed using the 377 ABI Prism sequencer as described by the 
instrument protocol P/N 402078 (Perkin Elmer). 



SUBSTITUTE SHEET (RULE 26) 



WO 99/20795 PCT/GB98/03081 

19 



iv Results and discussion 

A 424 bp fragment including KIT cDNA exon 16-19 was amplified from all 
pigs. The Hampshire pigs did not show any additional products whereas the 
5 Large White pigs (eight tested) all showed a 301 bp truncated cDNA fragment 
(Fig 2). Sequence analysis revealed the 424 bp fragment was identical in the 
two breeds whereas the whole exon 17 (123 bp) was missing from the 301 bp 
fragment. Apparent differences between individuals regarding the relative 
amounts of these two products may have been caused either by different 
10 genotypes containing differing numbers of copies of the KIT gene sequence, 
individual differences in mRNA expression levels or random RT-PCR effects. 

The two upper fragments present in Large white pigs represent heteroduplexes 
between the 301 and 424 bp fragments (Fig. 2). This was shown by an 
15 experiment where these slow migrating fragments were generated by pooling 
homoduplexes of the 424 and 301 bp which were then heat denatured and 
cooled to 25°C Moreover, cloning of the lower heteroduplex fraction of a Large 
White pig resulted in clones with insert length corresponding to either of the 
two homoduplexes. 

20 

Example 2 

PCR Amplifica tion and Sequencing of KIT Exon 17-Intron 17 (5 9 Splice Site) 

i. PCR to produce DNA Sequencing Template 
25 A 175 bp region including the boundary between exonl7 and intronl7 of the 
KIT gene was amplified for sequence analysis using forward primer KIT21 (5' - 
GTA TTC ACA GAG ACT TGG CGG C -3') and reverse primer KIT35 (5' - 
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AAA CCT GCA AGG AAA ATC CTT CAC GG - 3'). PCR was carried out on 
a DNA thermal cycler (Perkin Elmer 9600) in a total volume of 20 |il 
containing 25 ng genomic DNA, 1.0 mM MgCl 2 , 50 mM KC1, 10 mM Tris- 
HC1, pH 8.3, 200 dNTPs, 0.5 U AmpliTaq Gold (Perkin Elmer) and 10 
5 pmol of both KIT21 and KIT35 primer. To activate AmpliTaq Gold, initial heat 
denaturation was carried out at 94°C for 10 minutes followed by 32 cycles each 
consisting of 45 sec at 94°C, 45 sec at 55°C and 45 sec at 72°C. The final 
extension lasted for 7 min at 72°C. PCR products were cloned into vector 
pUCl 8 using the SureClone ligation kit (Pharmacia Biotech), 

10 

ii. Preparation of Plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jetstar 
plasmid midi kit (Genomed) and the resulting DNA diluted to 150 ng/^1. 

1 5 iii.Sequencing of plasmid DNA 

DNA was sequenced as in example 1 section iii. 

iv. Results 

A portion of the DNA sequence from exon 17 and intron 17 of the KIT gene 
20 was determined and compared between animals with each of these three alleles. 
Figure 3 shows that the / allele carries a splice site mutation at position 1 of 
intron 17. This G to A base substitution is present in one of the two gene copies 
carried on each chromosome. The base substitution occurs in the invariant GT 
dinucleotide which characterises 5' exon/intron boundaries. Analysis of the f 
25 allele showed the splice site mutation was not present in either the normal 
(KIT1) or duplicated copy of the gene (KIT2). We have found the splice site 
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mutation is unique to the / alleles, and therefore makes it possible to distinguish 
the I-KIT2 sequences. 

Example 3 

Testing For the Presence of the Splice Site Mutation with PCR RFLP 

To easily test for the presence of the G to A splice site mutation, restriction 
endonuclease Malll (CATG) was used to exploit the point substitution 
identified at position 1 of intron 17 (Figure 3). The Malll recognition sites in 
the fragment amplified from KIT and the expected restriction products are 
illustrated in Figure 4A and 4B respectively. 



i. DNA preparation for RFLP Test 

DNA can be prepared from any source of tissue containing cell nuclei, for 
example white blood cells, hair follicles, ear notches and muscle. The 
procedure here relates to blood cell preparations; other tissues can be processed 
similarly by directly suspending material in K buffer and then proceeding from 
the same stage of the blood procedure. The method outlined here produces a 
cell lysate containing crude DNA which is suitable for PCR amplification. 
However, any method for preparing purified, or crude, DNA should be equally 
effective. 

Blood was collected in 50mM EDTA pH 8.0 to prevent coagulation. 50jxl of 
blood was dispensed into a small microcentrifuge tube (0.5ml Eppendorf or 
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equivalent). 450^1 of TE buffer was added to lyse the red blood cells (haem 
groups inhibit PCR) and the mix vortexed for 2 seconds. The intact white and 
residual red blood cells were then centrifuged for 12 seconds at 13,000 g in a 
microcentrifuge. The supernatant was removed by gentle aspiration using a low 
pressure vacuum pump system. A further 450^1 of TE buffer was then added to 
lyse the remaining red blood cells and the white blood cells collected by 
centrifiigation as before. If any redness remained in the pellet, this process was 
repeated until the pellet was white. After removal of the last drop of 
supernatant from the pelleted white blood cells, 100^1 of K buffer containing 
proteinase K was added and the mixture incubated at 55 degrees C for 2 hours. 
The mixture was then heated to 95-100 degrees C for 8 minutes and the DNA 
lysates stored at -20°C until needed. 

Reagents 

T.E. Buffer: lOmM TRIS-HC1 pH8.0 

ImM EDTA 

K Buffer: SOmMKCl 

lOmMTRIS-HCl pH8.3 
2.5mM MgC12 
0.5%Tween 20 

ii. Restriction Enzyme Digestion and Electrophoresis 

The PCR amplification product is 175 bp in length. To test for polymorphism at 
position 1 of intron 17, digestion reactions were set up as below: 

3.0^1 PCR amplified DNA 
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1.0 nl 10 XNEBuffer4 
0.1 >ilBSA100|ig/ml 
0.1 jilMaIII10U/nl 
5.8 \x\ dH20 



5 



(1 X NEBuffer 4 (New England Biolabs) contains 50 mM potassium acetate, 20 
mM Tris acetate, 10 mM magnesium acetate and 1 mM DTT). Following 
incubation at 37°C for 90 minutes each 10 fxl reaction volume had 2 \d of 
loading dye added and the mix loaded on a 8% native polyacrylamide gel 
10 (Protogel, 37.5:1 acrylamide:bisacrylamide , National Diagnostics, Atlanta) in 
0.5 X TBE (44.5 mM Tris pH 8.0, 44.5 mM boric acid and 0.5 mM EDTA) and 
electrophoresed for 3 hours at 200V in a vertical slab unit (SE600 Hoefer 
Scientific Instruments). Products were visualised by ethidium bromide staining. 

15 iii Results 

A PCR RFLP protocol was designed to test for the presence of the splice site 
mutation as the substitution occurs within the recognition site for restriction 
endonuclease Malll. Figure 4B illustrates that presence of the G to A base 
substitution at position 1 of KIT intron 17 results in restriction at each of two 

20 Malll recognition sites within the 175 bp DNA fragment. Following 
electrophoresis, this results in fragments of sizes 80 bp, 54 bp and 41 bp. Where 
the splice site mutation is absent however, incubation with Main results in 
digestion only at recognition site 1 . Following electrophoresis this results in 
fragments of 134 bp and 41 bp. The invariant Nlalll recognition site 1 serves as 

25 an internal control to ensure complete digestion has taken place. Results of this 
PCR RFLP analysis are illustrated in Figure 4C. Analysis was performed on 
fragments amplified from clones which either carry the splice site mutation 
(lane 2) or carry the normal splice site sequence (lane 3). Lane 4 shows the 
result of analysis where DNA amplified from the genomic DNA of a coloured 
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animal was used. Lane 5 shows the resulting bands where a white animal was 
tested. The test was used to analyse 121 individuals from seven different breeds 
of pig. The splice site mutation was found only in the 97 animals with the 
dominant white phenotype (//- or IVi) and none of the 24 coloured (/ or i) 
examples (Table 2). This analysis confirms / and /* to be unique in that they are 
the only alleles to carry the splice site mutation. 



TABLE 2 

Distribution of the Splice Site Mutation Between Different Breeds and Coat 
Phenotype 



Breed 


Coat 


Assumed 


Animals 


Normally 


Splice 




Colour 


Genotype 1 


Tested 


spliced KIT 2 


Mutation 2 


Large White 


White 


V- 


33 


33 


33 


Landrace 


White 


V- 


56 


56 


56 


Hampshire 


Coloured 


i/i 


5 


5 


0 


Duroc 


Coloured 


i/i 


5 


5 


0 


Pietrain 


Coloured 


i/i 


8 . 


8 


0 


Meishan 


Coloured 


i/i 


5 


5 


0 


Wild Boar 


Coloured 


i/i 


1 


1 


0 


Wild Boar 


White 


I*/- 


8 


8 


8 


x Large 












White 












Totals 














White 


V- 


89 


89 


89 




White 


1*1- 


8 


8 


8 




Coloured 


i/i 


24 


24 


0 



1 White animals may be homozygous or heterozygous for the / allele 

2 Presence of the splice site mutation determined by Nlalll PCR RFLP test 
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Example 4 

Quantification of Normal KIT and Splice Mutant KIT flntron 17 ntl °^ A > 

5 As the splice site mutation is present in only one of the duplicated regions of/ 
and not in the duplicated region off, the various genotypes can be expected to 
have the attributes described in Table 3. 
TABLE 3 



Genotype Copies of Normal Copies of KIT Ratio of normal KIT 

KIT containing the splice to splice mutant KIT 

mutation 



I/I 2 

I/i 2 

i/i 2 

W p 3 

I p /i 3 



10 

Due to the dominance of allele /, three of the genotypes in Table 2 are carried 
by white animals and therefore can not be identified by phenotypic 
characterisation. Quantification of the relative amounts of the normal KIT gene 
and the splice mutant KIT gene allows the ratio between the two to be 
15 calculated, and therefore the genotype of individual animals predicted. This was 
achieved by quantification of two DNA fragments following Malll digestion. 



2 1:1 

1 2:1 

0 2:0 

1 3:1 
0 3:0 
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The amount of 134 bp fragment, representative of the normally spliced KIT 
gene, and of 54 bp fragment, representative of the splice mutant KIT, were 
measured following electrophoresis using GeneScan software. 

5 i. PCR to Produce DNA for Quantification 

As described in example 2 section i. The reverse primer KIT35 is labelled with 
the ABI fluorescent dye FAM at the 5' end. 

ii Restriction Enzyme Digestion 

1 0 As described in example 2 section ii. 

iii Electrophoresis and Quantification of DNA Fragments 

Following digestion, 0.5 \l\ of the reaction volume was mixed with 2.5 |il of 
deionised formamide, 0.5 \x\ of GS350 DNA standard (ABI) and 0.4 ^1 blue 

15 dextran solution before being heated to 90°C for 2 minutes and rapidly cooled 
on ice. Three jil of this mix was then loaded onto a 377 ABI Prism sequencer 
and the DNA fragments separated on a 6% polyacrylamide gel in 1 X TBE 
buffer for 2 hours at 700 V, 40 mA, 32 W. The peak area of fragments 
representative to both the normal and splice mutant forms of KIT were 

20 quantitated using the GeneScan (ABI) software. 

iv. Ratio Calculations 

The peak area value of the 134 bp fragment (normal KIT) was divided by twice 
the peak area value of the 54 bp fragment (splice mutant KIT) in order to 
25 calculate the ratio value for each sample. 
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v. Results 

Analysis was performed on animals from the Swedish wild pig/Large White 
intercross pedigree for which genotypes at / have been determined by 
conventional breeding experiments with linked markers. Figure 5 and Table 4 
show the ratio of normal to mutant KIT calculated for animals from each of the 
three genotype classes, ///(expected ratio 1:1), I/i (expected ratio 2:1) and I/f 
(expected ratio 3:1). The results are entirely consistent with the expected ratio 
values and indicate that the three genotype classes can be distinguished using 
this method. 

TABLE 4 

Ratio of the Two KIT Forms in Different Dominant White Genotypes in a 
Wild Pig/Large White Intercross 



Genotype Phenotype Expected Observed Ratio Number 

Ratio (Normal :Mutan Tested 
(Normal: t) 
Mutant) 



±SE 



I/I White 1:1 1.15 ±0.075 13 

I/I p White 3:1 3.11 ±0.084 12 

I/i White 2:1 2.23 ±0.109 14 



Figure 5 illustrates that the range of ratio values calculated for the two 
genotypes I/I and I/f do not overlap. This enables animals carrying the / allele 
to be identified and the frequency of the allele within different pig breeds 
determined. Ratio values were calculated for 56 Landrace and 33 Large White 
animals and the results are shown in Figure 6. A clearly bimodal distribution is 
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observed with 7 Landrace and 3 Large White individuals having a ratio value of 
approximately 3 or above, suggesting them to be heterozygous carriers for the 
f allele (genotype I/f). This means f has gene frequency estimates of 6,25% 
(7/1 12 chromosomes tested) and 4.5% (3/66 chromosomes tested) within the 
Landrace and Large White breeds respectively. 

Example 5 

Analysis for presence and quantification of the porcine KIT splice mutation 
using the Pg ABI TaqMan chemistry 

Method 

i. Preparation of template DNA for PCR 
DNA was prepared as in example 3, section i 

ii. TaqMan® PCR reactions 

TaqMan® PCR reactions were set up as shown in table 5 
TABLES 



P CR mix for TaqMan® based splice mutation test 



Reagent 


Final Cone 0 


Volume 


lOx TaqMan® Buffer A (Perkin Elmer) 


lx 


2.50 ul 


25mMMgCl 2 Sol D 


5mM 


5.00 ul 


DATP 


200uM 


0.50 ul 


DCTP 


200uM 


0.50 ul 


DGTP 


200uM 


0.50 )il 


DUTP 


200uM 


0.50 |il 


Amplitaq Gold™ (5U/ul) (Perkin Elmer) 


0.05U/ul 


0.25 |il 


AmpErase™N-Glycosylase (lU/ul) (Perkin 


O.OlU/ul 


0.25 ul 


Elmer) 






KITTM -NEST-F (5uM) 


500nM 


2.50 |il 


KJTTM-NEST-R (5uM) 


500nM 


2.50 ul 


KITTM FAM (5uM) 


lOOnM 


0.50 |il 


KITTM TET (5uM) 


lOOnM 


0.50 ul 
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25% Glycerol 
Porcine genomic DNA 



8.00 \d 
1.00 ul 
25.00 



The PCR primers used were as described below: 

KITTM-Nest-F (5'-CTC CTT ACT CAT GGT CGA ATC ACA-3') and 

5 

KITTM -Nest-R (5'-CGG CTA AAA TGC ATG GTA TGG-3'). 



10 KITTM-A FAM (5'-TCA AAG GAA ACA TGA GTA CCC ACG CTC- 



KITTM -G TET (5'- TCA AAG GAA ACG TGA GTA CCC ACG C -3') 

15 The TaqMan® probes were prepared by Perkin Elmer and labelled with FAM 
and TET as indicated as well as the standard quenching group TAMRA. The 
lOx TaqMan® Buffer A, Amplitaq Gold™, AmpErase N-Glycosylase, NTP's 
and 25mM MgCl 2 used were part of the TaqMan® PCR Core reagent Kit, 
supplied by Perkin-Elmer. 



The reactions were then placed into a Perkin Elmer ABI Prism 7700 Sequence 
Detector and the reaction carried out using the following thermal profile, 50°C 
for 2 minutes, 95°C for 10 minutes followed by 40 cycles of 95°C 15s, 62°C 
60s. The reactions were carried out under the control of 'Sequence Detector 
25 V.1.6' software using the 'Single Reporter' and Real-Time' options with the 
'Spectral Compensation' function activated. Upon completion of the run real- 
time profiles for each sample were examined on the ABI7700 to check for any 
samples giving highly irregular profiles which were then excluded. The 
thresholds for both dyes, Fam and Tet, were set so that they intercepted each 



The TaqMan® probes used were: 



3') and 



20 
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dye during the exponential phase of PCR. Following updating of the 
calculations in Sequence Detector V.1.6* software results were exportated into 
MS Excel for farther analysis, 

5 iii. Analysis of results 

Based upon the underlying theoretical principle that one cycle of PCR gives a 
doubling in the amount of cleavage of the quenching dye from the allele 
specific probe and therefore doubles the signal one would expect the threshold 
10 cycle numbers from the II and Ii genotypes analysed to be as below: 

Table 6: 

Theoretical results for TaqMan® analysis of genotype at the KIT splice 
IS mutation 



Genotype 


Copies KIT 1 


Copies KIT 2 


Theoretical Ct 


Theoretical Ct 




(G) 


(A) 


TET(G) . 


FAM(A) 


II 


2 


2 


X 


Y 


Ii 


2 


1 


X 


Y+l 



In theory the Ct for TET and FAM signals, represented as X and Y should be 
the same, as equal numbers of copies of the target sequences should be present 
in an // animal. However in practice this does not necessarily occur due to 
20 differences in the hybridization and cleavage efficiency of the two probes and 
variation in the setting of the threshold cycle between the two dye signals. The 
reduction in splice mutant containing (A) sequences relative to those qoJ 
containing the splice mutation (G) in the Ii animals ie 2:1 G:A ratio rather than 
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1:1 as for // genotype, should lead to the FAM signal reaching the threshold 1 
cycle later than the TET signal in the genotype Ii animals. The actual results for 
samples tested are shown in Table 7. 

Table 7 

Ct values from analysis of II and Ii genotypes 

Sample Genotype Ct FAM (A) Ct TET Ct FAM - Ct TET 



(G) 


1 


Ii 


24.68 


22.59 


2.09 


2 


Ii 


25.98 


23.62 


2.36 


3 


Ii 


26.54 


25.57 


0.97 


4 


Ii 


27.37 


24.78 


2.59 


5 


Ii 


24.94 


21.61 


3.33 


6 


Ii 


25.68 


22.1 


3.58 
Ii Mean = 2.49 


7 


II 


22.05 


23.78 


-1.73 


8 


II 


24.22 


24.59 


-0.37 


9 


II 


24.19 


23.85 


0.34 


10 


II 


23.66 


23.51 


0.15 


11 


II 


24.35 


22.71 


1.64 


12 


II 


22.82 


21.69 


1.13 


13 


n 


22.84 


22.7 


0.14 


14 


II 


23.17 


22.9 


0.27 
Mean = 0.20 
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No 35 35 0 

Template 

No 35 35 0 

Template 

No 35 35 0 

Template 

No 35 35 0 

Template 



Despite variation around the mean values it can be seen from Table 7 that there 
is a significantly increased delay in the FAM signal reaching the threshold level 
(approximately 2 cycles) relative to the TET signal in Ii animals compared to // 
animals as predicted, reflecting the reduced number of copies of the splice 
mutant (A) sequence present in animals of the Ii genotype. Plotting of the 
individual samples on a scatter plot (Figure 7) shows clustering of the two 
genotypes with the Ii cluster shifted along the Ct FAM axis due to the reduced 
number of copies of the KIT2 (A) sequence for which the FAM probe is 
specific. 
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CLAIMS: 

1, A method for determining coat colour genotype in a pig which comprises: 

5 (a) obtaining a sample of pig nucleic acid; and 

(b) analysing the nucleic acid obtained in (a) to determine whether a mutation 
is/is not present at one or more exon/intron splice sites of the KIT gene. 

10 2. A method as claimed in claim 1 wherein the analysis in step (b) is carried 
out to determine whether a mutation is/is not present at the exon 17/intron 17 
boundary. 

3. A method as claimed in claim 2 wherein the mutation consists of the 
1 5 substitution of the G of the conserved GT pair for A. 

4. A method as claimed in any one of claims 1 to 3 wherein the sample of 
nucleic acid is amplified prior to analysis. 

20 5. A method as claimed in claim 4 wherein the nucleic acid is genomic 



7. A method as claimed in claim 6 wherein the pair of suitable primers is: 



DNA. 



6. A method as claimed in claim 5 wherein amplification is carried out 
using PCR and at least one pair of suitable primers. 



25 
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5'-GTA TTC ACA GAG ACT TGG CGG C-3'); and 
5'-AAA CCT GCA AGG AAA ATC CTT CAC GG-3\ 

8. A method as claimed in any one of claims 5 to 7 wherein after 
amplification the nucleic acid is treated with a restriction enzyme, followed by 
analysis of fragment lengths. 

9. A method as claimed in claim 8 wherein the nucleic acid is treated with 
the restriction enzyme Main. 

10. A method as claimed in claim 8 or claim 9 wherein the ratio of 
restriction fragment lengths is determined. 

11. A method as claimed in claim 4 wherein the nucleic acid is mRNA. 

12. A method as claimed in claim 11 wherein the nucleic acid is amplified 
using RT-PCR. 

13. A method as claimed in claim 12 wherein the length of RT-PCR product 
is determined. 

14. A method for determining coat colour genotype in a pig which comprises the 
step of analysing a sample of pig KIT protein to determine whether the protein is the 
splice variant protein. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/20795 



35 



PCT/GB98/03081 



15. A kit for use in determining the coat colour genotype of a pig which 
comprises one or more reagents suitable for determining whether a mutation is 
present at one or more exon/intron splice sites of the KIT gene. 

16. A kit as claimed in claim 1 5 which comprises one or more reagents for 
carrying out PCR and one or more pairs of suitable primers. 

17. A kit as claimed in claim 16 which comprises the following pair of 
primers: 

5*~GTA TTC ACA GAG ACT TGG CGG C-3'); and 
5'-AAA CCT GCA AGG AAA ATC CTT CAC GG-3\ 
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Sequence Alignment Across the Exon /Intron Border of KIT Exon 17 

Allele Gene Copy Sequence 

Exon 17 Intron 17 

|4 

I KIT1 AAT TAC GTG GTC AAA GGA AAC I GTG AGT ACC CAC GCT CTC CTG ACA GTC 
KIT2 |A 

I P KIT1 |G 

KIT1 |G 

i KIT1 |G 

FIG.3 
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Ratio of normal to splice mutant KIT 




♦ ratio 



FIG.5 
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Ratios for Landrace and Large White 




♦ ratio 



Landrace Breed Large White 



FIG.6 
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Plot of Ct FAM vs Ct TET for TaaMan based PCR 
analysis of p orcine KIT splice mutation genotype 
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